Locally differentially private high-dimensional data synthesis

نویسندگان

چکیده

In local differential privacy (LDP), a challenging problem is the ability to generate high-dimensional data while efficiently capturing correlation between attributes in dataset. Existing solutions for low-dimensional synthesis, which partition budget among all attributes, cease be effective scenarios due large-scale noise and communication cost caused by high dimension. fact, characteristics not only bring challenges but also make it possible apply some technologies break this bottleneck. This paper presents SamPrivSyn synthesis under LDP, composed of marginal sampling module generation module. The used sample from original obtain two-way marginals. process based on mutual information, updated iteratively retain, as much possible, attributes. reconstruct synthetic dataset sampled Furthermore, study conducted comparison experiments real-world datasets demonstrate effectiveness efficiency proposed method, with results proving that can protect retain information

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Optimizing Locally Differentially Private Protocols

Protocols satisfying Local Differential Privacy (LDP) enable parties to collect aggregate information about a population while protecting each user’s privacy, without relying on a trusted third party. LDP protocols (such as Google’s RAPPOR) have been deployed in real-world scenarios. In these protocols, a user encodes his private information and perturbs the encoded value locally before sending...

متن کامل

Locally Differentially Private Heavy Hitter Identification

The notion of Local Differential Privacy (LDP) enables users to answer sensitive questions while preserving their privacy. The basic LDP frequent oracle protocol enables the aggregator to estimate the frequency of any value. But when the domain of input values is large, finding the most frequent values, also known as the heavy hitters, by estimating the frequencies of all possible values, is co...

متن کامل

Locally Differentially Private Protocols for Frequency Estimation

Protocols satisfying Local Differential Privacy (LDP) enable parties to collect aggregate information about a population while protecting each user’s privacy, without relying on a trusted third party. LDP protocols (such as Google’s RAPPOR) have been deployed in real-world scenarios. In these protocols, a user encodes his private information and perturbs the encoded value locally before sending...

متن کامل

Differentially Private Synthesization of Multi-Dimensional Data using Copula Functions

Differential privacy has recently emerged in private statistical data release as one of the strongest privacy guarantees. Most of the existing techniques that generate differentially private histograms or synthetic data only work well for single dimensional or low-dimensional histograms. They become problematic for high dimensional and large domain data due to increased perturbation error and c...

متن کامل

Adaptive Differentially Private Histogram of Low-Dimensional Data

We want to publish low-dimensional points, for example 2D spatial points, in a differentially private manner. Most existing mechanisms publish noisy frequency counts of points in a fixed predefined partition. Arguably, histograms with adaptive partition, for example Voptimal and equi-depth histograms, which have smaller bin-widths in denser regions, would provide more statistical information. H...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Science China Information Sciences

سال: 2022

ISSN: ['1869-1919', '1674-733X']

DOI: https://doi.org/10.1007/s11432-022-3583-x